On high dimensional data spaces
نویسنده
چکیده
Data mining applications usually encounter high dimensional data spaces. Most of these dimensions contain ‘uninteresting’ data, which would not only be of little value in terms of discovery of any rules or patterns, but have been shown to mislead some classification algorithms. Since, the computational effort increases very significantly (usually exponentially) in the presence of a large number of attributes, it is highly desirable that all irrelevant attributes be weeded out at an early stage. Often, patterns of interest are embedded in lower dimensional subspaces of data. If the data space S has k attributes G {al, a2...a~), then a n-dimensional subspace s. of the data space S can be formed by selecting a combination of n attributes from the set {al,az...ak), where n < k. It is usual to tackle this problem by getting some attributes and subspaces identified by the user (or domain experts). For even moderately large number of attributes, the number of possible subspaces is so large, that it is quite unlikely that the ‘experts’ would be able to identify all the ‘interesting’ subspaces.
منابع مشابه
یک روش مبتنی بر خوشهبندی سلسلهمراتبی تقسیمکننده جهت شاخصگذاری اطلاعات تصویری
It is conventional to use multi-dimensional indexing structures to accelerate search operations in content-based image retrieval systems. Many efforts have been done in order to develop multi-dimensional indexing structures so far. In most practical applications of image retrieval, high-dimensional feature vectors are required, but current multi-dimensional indexing structures lose their effici...
متن کاملImproving Visualization of High-Dimensional Music Similarity Spaces
Visualizations of music databases are a popular form of interface allowing intuitive exploration of music catalogs. They are often based on lower dimensional projections of high dimensional music similarity spaces. Such similarity spaces have already been shown to be negatively impacted by so-called hubs and anti-hubs. These are points that appear very close or very far to many other data point...
متن کاملApplication of Architectural Visual Documents and Oral History of in the Representation of Micro-Spaces and Three-Dimensional (3D) Modeling of Nawab Razavi Historical House in Yazd
Over the time, various factors have led to damage the Iranian houses. By examining the surviving documents of Nawab Razavichr('39')s house in Yazd, it is possible to represent a major part of the lost spaces and also to minimize speculation in the restoration of this historic house. The basic belief of this research is that the studies of the oral history of architecture as well as the existing...
متن کاملA Geometry Preserving Kernel over Riemannian Manifolds
Abstract- Kernel trick and projection to tangent spaces are two choices for linearizing the data points lying on Riemannian manifolds. These approaches are used to provide the prerequisites for applying standard machine learning methods on Riemannian manifolds. Classical kernels implicitly project data to high dimensional feature space without considering the intrinsic geometry of data points. ...
متن کاملAn extension theorem for finite positive measures on surfaces of finite dimensional unit balls in Hilbert spaces
A consistency criteria is given for a certain class of finite positive measures on the surfaces of the finite dimensional unit balls in a real separable Hilbert space. It is proved, through a Kolmogorov type existence theorem, that the class induces a unique positive measure on the surface of the unit ball in the Hilbert space. As an application, this will naturally accomplish the work of Kante...
متن کاملOn 5-dimensional 2-step homogeneous randers nilmanifolds of Douglas type
In this paper we first obtain the non-Riemannian Randers metrics of Douglas type on two-step homogeneous nilmanifolds of dimension five. Then we explicitly give the flag curvature formulae and the $S$-curvature formulae for the Randers metrics of Douglas type on these spaces. Moreover, we prove that the only simply connected five-dimensional two-step homogeneous Randers nilmanifolds of D...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003